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47 
CLAIMS 

1 . A method of operating on a. text comprising a plurality 
of text units, each comprising one or more strings, the 

5 method being characterised by: 

forming a structure for each of at least some of said 
strings, in which structure a string is associated with 
each text unit in which the string occurs? 

10 

for each text unit summing the number of occurrences of 
each other text unit in the same structure or structures 
so as to form an individual score for each pair of text 
units ; and 

15 

processing said individual scores for each text unit in 
order to form a final score for each text unit . 

2 . A method of operating on a text as claimed in claim 1 , 
20 which includes the further step of ranking the text units 

on the basis of said Individual scores. 

3 . A method of operating on a text as claimed in 
claim 1, wherein said text units are sentences, said 
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strings are words forming said sentences , and the method 
comprises the additional, steps off removing stop-words, 
stemming each remaining word and indexing the sentences 
prior to carrying out said summing step, and wherein said 
5 structures are stem-index records each comprising a 
stemmed word and one or. more indexes corresponding to 
sentences in which said stemmed word occurs . 

4, A method of operating on a text as claimed in claim 1, 
10 wherein said text is associated with a word text comprising 

words, each ward being associated with one or more subject 
codes representing subjects with which said word is 
associated, and wherein said strings are subject codes 
associated with said words . 

15 

5. A method of operating on a text as claimed in claim 4, 
which comprises the further step of keeping a record of 
the word spelling associated with each occurrence of a 
subject code in a text unit, and wherein during said 

20 summing step occurrences of the same subject code in a 
pair of text units are disregarded if the same word 
spelling is associated with said same subject code in said 
pair of text units - 
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6 . A method of operating on a text as claimed in 
claim 5. wherein said step of disregarding occur- 
rences of subject codes is not carried, out for subject 
codes which relate to only a single word spelling in the 

5 word text . 

7. A method of operating on a text as claimed in 
claim 1, wherein said processing step includes 
calculating a level for each text unit, in addition 

10 to said final score, and wherein said level Indicates 
the value of the highest of said individual scores in 
relation to a threshold value, 

8 . A storage medium containing a program for controlling 
15 a programmable data processor (70) to perform a method 
as claimed in claim .1 . 

9. A system for ranking text units in a text, the system 
comprising a data processor (70) programmed to perform 
20 the steps of the method of claim 1 . 



